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Claims 

\ 1. Method of animating a synthesised model of a human face driven by an audio 
\ signal, comprising an analytic phase, in which an alphabet of visemes is 
\ determined, i.e. a set of information representing the shape of a face of a 
5 Speaker corresponding to phonetic units extracted from a set of audio training 
signals, and a synthesis phase, in which the audio driving signal is converted 
into\a sequence of phonetic units associated to respective temporal 
information, whereas the sequence of visemes, corresponding to the phonetic 
units of\the set comprised in the audio driving signal, are determined in the 
10 analytic phase, and the transforms required to reproduce the sequence of 
visemes ark applied to the model 

characteriseckbv the fact said analytic phase provides an alphabet of visemes, 
determined as jactive shape model parameter vectors, to which the respective 
transforms of theVnodel, expressed as parameters of low-level facial animation 

15 compliant with standard ISO/IEC 14496, are associated. During both the 
analytic phase anck the synthesis phase, the sequences of visemes, 
corresponding to the phonetic units of the audio training signal and of the audio 
animation driving signal respectively, are transformed into continuous 
representations of movement by means of viseme interpolation, conducted as 

20 convex combinations ofMtreyvisemes themselves to which combination 
coefficients, which arefdont^c^ of time, are associated, the 

combination coefficients rarriedW in thjp synthesis phase being the same as 
those used for the analytic ftb^ 
2. Method according to claim\T^h9raMerised by the fact that the coefficients of 

25 said convex combinations are l*wnptfonkof the following type: 
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\3. Method according to claim 1 «er2Tcharacterised by the fact that the wire-frame 
vertices, corresponding to the model feature points, on the basis of which facial 
animation parameters are determined in the analytic phase, are identified and 
said viseme interpolation operations are conducted by applying transforms on 
5 feature points for each viseme, for animating a wire-frame based model. 

4. Method according to claim 3, characterised by the fact that, for each position to 
be^assumed by the model in said synthesis phase, the transforms are applied 
only\to the vertices of the wire-frame corresponding to the feature points and 
the transforms are extended to the remaining vertices by means of a convex 

10 combination of the transforms applied to the vertices of the wire-frame 
corresponding to the feature points. 

5. Method according to claim 1, characterised by the fact that said visemes are 
converted into co-ordinates of the feature points of the face of the speaker, 
followed by conversion of said co-ordinates into said low-level facial animation 

15 parameters, ascJescribed in standard ISO/IEC 14496. 

6. Method accordindto claim 5, characterised by the fact that said low-level facial 
animation parameters, representing the co-ordinates of feature points, are 
obtained by analysing the movements of a set of markers (7) which identify the 
feature points themsefVes. 

20 7. Method according to\claim 6, characterised by the fact that the data 
representing the co-ordinates of the feature points of the face are normalised 
according to the following method: 

- a sub-set of markersVe associated to a stiff object (8) applied to the 
forehead 

25 - the face/of th& speakerNis set, at the beginning of the recording, to 

assume a positibp^orrespOT as far as possible to the position of a 
neutral face mAl, as definkl in standard ISO/IEC 14496, and a first 
frame of the face inysuch neatraYposition is obtained; 
for all frames subsequent to the tiret frames the sets of co-ordinates are rotated 
30 and translated so that the co-orainates corresponding to the markers of said sub- 
set coincide with the co-ordinates of the markers of the same sub-set in the first 




